Tags: language models*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. Ollama has partnered with NVIDIA to optimize performance on the new NVIDIA DGX Spark, powered by the GB10 Grace Blackwell Superchip, enabling fast prototyping and running of local language models.
  2. This Perspective outlines ways in which generative artificial intelligence aligns with and supports the core ideas of generative linguistics, and how generative linguistics can provide criteria to evaluate and improve neural language models.
  3. This paper surveys recent replication studies of DeepSeek-R1, focusing on Supervised Fine-Tuning (SFT) and Reinforcement Learning from Verifiable Rewards (RLVR). It details data construction, method design, and training procedures, offering insights and anticipating future research directions for reasoning language models.
  4. An introduction to evaluating language models with easy-to-understand metrics.
  5. Understand temperature, Top-k, Top-p, frequency, and presence penalty for LLM hyperparameters once and for all with visual examples.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "language models"

About - Propulsed by SemanticScuttle